Mining and Validation of Localized Frequent Web Access Patterns with Dynamic Tolerance

نویسندگان

  • Olfa Nasraoui
  • Suchandra Goswami
چکیده

Mining user profiles is a crucial task for Web usage mining, and can be accomplished by mining frequent patterns. However, in the Web usage domain, sessions tend to be very sparse, and mining the right user profiles tends to be difficult. Either too few or too many profiles tend to be mined, partly because of problems in fixing support thresholds and intolerant matching. Also, in the Web usage mining domain, there is often a need for post-processing and validation of the results of mining. In this paper, we use criterion guided optimization to mine localized and error-tolerant transaction patterns, instead of using exact counting based method, and explore the effect of different post-processing options on their quality. Experiments with real Web transaction data are presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining and Validating Localized Frequent Itemsets with Dynamic Tolerance

We cast the frequent itemset mining problem as a criterion guided optimization problem instead of one based on exact counting. This opens several interesting possibilities, including modification of the criterion function to take into account (i) error tolerance, (ii) locality, (iii) unsupervised estimation of the error tolerance, and (iv) search strategy. We also propose a new validation proce...

متن کامل

High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences

Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...

متن کامل

Multidimensional Web Access Pattern Tree (MD-WAP Tree)

Mining frequent web access patterns from large data (web log) is one significant application of sequential pattern mining. Web access patterns are set of frequent sub sequences that are useful to know user behaviour in real time in order to make dynamic decisions. Techniques for extracting web access patterns from data available in two flavours: apriori based and non apriori based (tree based)....

متن کامل

Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems

  One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...

متن کامل

FM-WAP Mining: In Search of Frequent Mutating Web Access Patterns from Historical Web Usage Data

Recently, a large amount of work has been done in web access pattern (WAP) mining. Most of the existing techniques focus on mining WAP that occur frequently from snapshot web usage data collection. However, web usage data is dynamic in real life. The dynamic nature of web usage data leads to two challenging problems. The first problem is the maintenance of existing WAP mining results. The secon...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006